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Abstract 

This  paper  proposes  and  tests  a  model  of  the  diffusion  of 
innovation  in  the  railroad  industry.   Specifically,  the  diffusion  of 
steel  rails,  which  supplanted  iron  rails  in  American  railroads  in  the 
late  19th  century,  is  examined.   The  model  is  supported  by  empirical 
tests. 
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A  Study  in  the  Diffusion  of  Innovation: 
Steel  Rails  in  American  Railroads 


By  Jan  K.  Brueckner* 

A  number  of  articles  dealing  with  the  speed  of  adoption  of  new 
technology  by  firms  has  appeared  since  1960.   In  what  has  become  a  standard 
reference,  Mansfield  [4]  formulated  and  tested  a  model  of  the  spread  of  new 
technology  within  an  industry.   His  goal  was  to  relate  the  rate  of  adoption 
of  diesel  locomotives  in  American  railroads  to  firm  characteristics.   The 
goal  of, the  present  study  is  to  propose  and  test  another  model  of  the 
diffusion  of  a  specific  innovation  in  the  railroad  industry:   steel  rails  in 
American  railroads  between  1867  and  1880. 

Mansfield's  analysis  is  somewhat  aid  hoc  in  that  his  choice  of 
explanatory  variables  is  justified  only  by  casual  theoretical  arguments. 
One  virtue  of  this  paper  is  that  it  contains  a  formal  model  of  the  track 
replacement  process  which  clearly  indicates  the  importance  of  certain 
explanatory  variables.   The  success  of  the  empirical  test  of  the  model  suggests 
that  a  more  formal  approach  to  modelling  diffusion  processes  can  be  fruitful 
and  that  such  an  approach  could  be  successfully  incorporated  in  future  work 
in  this  area. 

In  1867  the  first  Bessemer  steel  rails  were  produced  in  the 
United  States,  and  steel  rapidly  supplanted  iron  in  American  railroad  track. 
By  1890,  80%  of  the  nation's  total  track  mileage  was  steel.   This  study  will 
attempt  to  relate  in  cross  section  the  proportion  of  a  railroad's  track  that 
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had  been  converted  to  steel  by  the  year  1880  to  a  number  of  variables  suggested 
by  a  theoretical  model  of  the  replacement  process.   The  year  1880  was  chosen 
because  the  U.S.  Census  of  Transportation  of  1880  contains  excellent  data 
on  variables  of  interest.   In  addition,  the  variation  in  the  proportion  of 
track,  that  had  been  converted  by  1880  is  large  across  railroads,  which  heightens 
the  empirical  challenge  any  theoretical  model  must  face. 

Section  I  reviews  some  historical  facts  and  develops  the  theore- 
tical model.   Section  II  contains  a  discussion  of  econometric  problems. 
Section  III  presents  the  empirical  results,  and  Section  IV  offers  concluding 

remarks. 

I. 

The  superior  durability  of  steels  rails  was  the  reason  for  their 
rapid  diffusion  in  American  railroads.   Contemporary  sources  contain  a  wealth 
of  information  about  the  lifespans  of  steel  and  iron  rails,  although  the 
numbers  vary  considerably  depending  on  the  source.   The  1900  Railroad 
Gazette  cites  15  million  tons  of  traffic  as  the  lifetime  capacity  of  iron 
rails,  while  claiming  that  steel  rails  could  handle  95  million  tons  before 
replacement.   One  source  [1]  claims  the  capacity  of  steel  was  100  to  250 
million  tons  depending  on  conditions  such  as  track  grade  and  curvature,  while 
yet  another  source  [9]  gives  125  million  tons  as  the  typical  capacity  of 
steel  rails.   Another  source  [7]  states  that  the  best  iron  rails  would  wear, 
on  the  average,  one- third  to  one-half  as  long  as  steel  rails.   But  he  also 
cites  the  experience  of  one  railroad  where  steel  rails  outlasted  13  sets 
of  iron  rails  with  no  perceptible  signs  of  wear,  and  claims  that  iron  rails 
lasted  only  four  months  on  a  line  between  New  York  and  Philadelphia  which 
experienced  heavy  traffic.   The  1900  Railroad  Gazette  states  that  a  consensus 
estimate  of  the  life  of  steel  rails  is  17  to  24.5  years,  while  another 
source  [9]  states  that  steel  lasted  30  to  40  times  as  long  as  average  iron 
rails.  While  no  single  verdict  emerges  from  the  sources,  it  is  clear  that 
steel  rails  outlasted  iron  rails  by  a  wide  margin,  with  the  lifespans  of 
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both  types  of  rails  depending  on  the  traffic  volume  per  period  on  the  line. 
These  facts  are  crucial  in  the  theoretical  model  which  is  developed  below. 

It  is  assumed  that  the  T  miles  of  iron  track  owned  by  a  given 
railroad  at  the  beginning  of  1867,  when  steel  first  became  available,  had  a 
uniform  age  distribution.   This  assumption  plays  an  important  role  in  the 
model,  but  there  is  no  way  of  easily  checking  it  from  available  sources.   If 
the  life  of  iron  rails  on  the  line  is  n  years,  where  n  is  an  integer,  then 
T/n  miles  come  up  for  replacement  in  each  of  the  years  numbered  zero  through 
n  -  1,  where  year  zero  corresponds  to  1867.   This  is  guaranteed  by  the 
uniform  age  distribution  assumption.  We  assume  that  n  -  1  <  13  =  r,  which 
says  that  all  original  track  fails  before  the  last  year  of  the  period  (year 
r  corresponds  to  1880).   This  assumption  seems  justified  in  the  light  of 
the  contemporary  accounts  cited  above.   It  is  also  assumed  that  n  is  constant 
across  the  period. 

The  next  important  assumption  is  that  when  track  fails,  the  rail- 
road replaces  it  with  steel  with  probability  p  and  with  iron  with  probability 
1  -  p.   The  determinants  of  this  probability  are  discussed  below.   Once  the 
track  is  replaced  with  steel  we  assume  it  will  not  fail  again  until  after 
1880,  an  assumption  which  seems  correct  given  contemporary  information. 
Ignoring  new  construction  for  the  moment,  it  will  be  true  that  the  nvimber  of 
steel  talles  owned  by  the  railroad  in  1880  is  the  sum  of  n  random  variables: 

Sn,  S,,  •••,  S   ,,  which  represent  the  number  of  steel  miles  in  1880  embodied 
0   1       n-1 

in  track  failing  initially  at  time  0,  1,  ...,  n  -  1.  We  now  examine  the 
properties  of  these  random  variables. 

Consider  the  ways  in  which  track  failing  at  time  zero  could 
become  steel  by  1880.   The  track  could  be  switched  to  steel  at  time  zero 
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with  probability  p  or  it  could  be  replaced  by  iron  at  time  zero  and  be  switched 

to  steel  at  its  next  failure  at  time  n  with  probability  (1  -  p)p»  or  it  could 

2 

be  replaced  with  steel  at  its  second  failure  with  probability  (1  -  p)  p, 

and  so  on.   The  number  of  opportunities  for  failure  before  the  end  of  the 
period  obviously  depends  on  the  relationship  between  n  and  r.   For  simplicity, 

suppose  r  =  6  instead  of  13  and  n  =»  2.   Track  failing  at  t  *  0  which  is 
always  replaced  by  iron  fails  again  at  t  =  2,  again  at  t  =  4,  and  again  at 
t  "  6.   It  experiences  3  =  r/n  failures  before  the  end  of  the  period.   If 
r  =  5,  only  two  secondary  failures  would  have  occurred  before  the  end  of 
the  period.   A  little  reflection  shows  that  the  number  of  secondary  failures 
of  track  initially  falling  at  t  =  0  and  always  replaced  by  iron  is  the  largest 
integer  less  than  or  equal  to  r/n,  orlnt(r/n).  Hence  the  probability  that 
track  falling  at  t  =  0  becomes  steel  by  t  =  r  is 


p  +  p(l  -  p)  +  . ..  +  p(l  -  p)i°t(r/n) ^  ^^j 


Similarly,  track  falling  initially  at  t  "  1  becomes  steel  by  t  «  r  with 

probability   P  +  p(l  -  p)  +  . . .  +  p(l  -  p)int((r-l)/n)  ^ 

In  general,  track  failing  at  t=  i<^n-  1  becomes  steel  by  t  ■»  r  with 

probability 

k^(l) 

p  I  (1  -  p)J  =  P(i.  n).  <2) 

3=0 

where  k'^(i)  =  lnt((r  -  i)/n).   The  expression  in  (2)  equals 

k"(i)  +  1 
1  -  (1  -  p)  "^       .  (3) 
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Thls  is  as  it  should  be,  since  the  probability  of  track  becoming  steel  is  one 

minus  the  probability  that  iron  is  always  used  for  replacement,  which  is 

k^(i)  +1 
(1  -  P)         .since  there  are  k  (i)  +  1  failures  if  iron  is  always  used. 

The  expected  value  of  the  random  variable  S.  is  then  P(i,  n) (T/n)  + 

(1  -  P(i,  n))  •  0  =  P(i,  n)(T/n),  and  the  expected  number  of  steel  miles  from 

replacement  of  all  original  iron  track  is'' 


T  ^-1 

-   Z   P(i,  n).  (4) 

i=0 


One  might  argue  that  this  model  is  unrealistic  because  it  allows  a  railroad 
to  replace,  for  example,  the  track  failing  at  t  =  0  with  steel  and  the  track 
failing  at  t  =  1  with  iron;  it  postulates  independence  of  different  replace- 
ment decisions.   While  this  argum.ent  carries  some  force,  it  is  not  ii!5)lausible 
that  replacement  decisions  for  different  track  segments  were  independent; 
steel  may  have  been  used  on  certain  sections  of  the  line  while  replacement 

continued  with  iron  on  other  segments. 

It  could  also  be  argued  that  a  model  with  a  constant  switchover 

probability  p  is  unrealistic.  The  spread  of  information  about  steel  rails 

and  the  decline  in  the  price  of  steel  relative  to  that  of  iron  might  have 

resulted  in  a  p  which  increased  over  the  period  1867-1880.  Unfortunately, 

it  is  not  possible  to  develop  empirical  implications  from  a  simple  model 

where  p  increases  over  time.^  However,  the  constant-p  model  is  defensible 

as  an  approximation,  especially  given  the  shortness  of  the  period  1867-1880. 
Many  railroads  constructed  new  track  over  the  period  1867-1880, 

and  the  previous  analysis  also  applies  to  this  new  construction.   The  expected 

number  of  steel  ndles  at  t  =  r  in  the  C(i)  miles  of  line  constructed  at 

period  i  is  C(i)P(i,  n) ,  assuming  the  probability  p  was  the  same  for  track 

replacement  and  new  construction.   The  expected  number  of  steel  miles  from 

all  new  construction  is 
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^Zq  C(i)P(±.  n). 


(5) 


Letting  ^Z^  C(i)  =  N,  the  expected  proportion  Q  of  steel  miles  In  all  track 
at  t  «=  r  is  derived  by  dividing  the  sum  of  (4)  and  (5)  by  T  +  N  =  A: 

^   n-1  r 

Q=—   Z   P(i,  n)+-   I        C(i)P(i,  n). 
"*  i=0  ^  1=0 


(6) 


Substituting  (3)  for  P(i,  n)  we  have 


f^        ^^T  ^~^  k  (i)    .        r 

Q  "  ^  -    nA     ^   (1-P)      -i^-^  Z  C(i)(l-  p)-r^-.   (7) 
i=0  i=0 


A±) 


Quantities  that  will  vary  across  firms  are  p  and  n,  and  we 
proceed  to  calculate  the  effects  of  changes  in  these  variables  on  Q.   Since 
k  (i)  >^  0,  it  is  immediately  apparent  that  3Q/3p  >  0;  the  expected  proportion 
of  steel  increases  as  the  probability  of  switch-over  from  iron  to  steel 
increases.   Since  n  is  an  integer,  we  examine  the  effect  of  unit  increases 
in  n.   From  (7) 


AQ 


_  (1  -  P)T 
A 


n-1 


,  .   .  k-Jd) 

i  Z   (1  -  p)  ^ 

i=0 


n  +  1 


j:  (1  -  p) 

i"0 


, n+1 . . . 
k^   (1) 


n    N   r  k"(i)         k""'"-'-(i) 

+    I  ^^   2   C(i)  [(1  -  p)  "^    -  (1  -  p)  ^     ], 
^    i=0 

k"(l)  k^+^i)   -  V^il) 

The  multiplicand  of  C(i)    is    (1  -  p)    '^        [1  -    (1  -  p)    ^  ^ 


]  ,  which 


(8) 


is  zero  or  negative  since  k   (1)  <  k  (i)  by  the  definition  of  k  and  since 
0  <  (1  -  p)  <  1.   This  is  true  because  (1  -  p)  raised  to  a  non-positive  power 
is  greater  than  or  equal  to  one.   Hence  the  second  term  in  (8)  is  zero  or 
negative.   The  first  term  in  (8)    has  the  same  sign  as 
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^  ,  n-1       k'^Ci)    n        k""*'^(i) 
^  Z   (1  -  P)  ^    -  ,   (1  -  p)  r 
i-0  1=0 

n-1       k°(i)     ,  kf  \i)  -  k^i)  kf  ^(n) 

i-0 

n-1       k^i)  kf  l(i)  -  k\i) 

-  E   (1  -  p)  '^   [1  -  (1  -  p)  "^        ""       ] 
i-0 

n-1  ,       k'^(i)  k°"^^(n)  -  k°(i) 


(9) 


+  z  ki-  ?)  ""     [1  -  (1  -  p)  "^  ""     ]. 


i-O  "" 


The  first  term  is  zero  or  negative  by  the  above  argument.   In  addition, 

k^'^-'-(n)  -  int((r  -  n)/(n  +]))  <.int  ((r  -  1) /n)  =  k^(i)  for  1  »  0.  1 n-1 

since  r  -  i  >  r  -  n  and  n  <  n  +  1.  Hence  the  second  term  is  zero  or  negative 
as  well,  and  we  have  established  AQ  <^  0;  an  increase  in  n,  the  life  of  the 
iron  rails,  decreases  or  leaves  unchanged  the  expected  proportion  of  the 
track  that  is  steel  at  t  =  r. 

If  we  let  T/A,  the  proportion  of  1880  track  built  by  1867,  equal 
q,  and  define  Z  =   N/(r  +  1)  rnd  letC(i)  =  ^  +  A(i),  then  (7)  may  be  written 

/I    \  n-1        k"(l)   /,    \/,    N   r        k  (1) 
Q  =  1  .  IL^  ,   (1  .  p)  r  ^  .  (1  -  P)(l  -  q)   ,   a  .  p)  r 


1=0  ""   "'     i«0 


k°(i) 


(10) 


/n    ,  ^  A(l),,    .  r'^^^ 
-  (1  -  p)  I    — V^(l  -  p) 


1-0   ^ 


It  may  be  shown  that  9Q/3q  is  ambiguous  in  sign.   If  C(i)  =  C  =  N/(r  +  1) , 
then  A(i)  =  0  for  all  i  and  the  last  term  in  (10.)  is  zero.  The  pattern  of 
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construction,   however,    tended  to  be  skewed  toward  the   early  part  of  the 

1867-1880  period   for  many   railroads.      Suppose  that   C(i)    assumes   a  high  constant 

value  in  the  early  part  of  the  period  and  drops  to  a  lower  constant  value  in 

the  latter  part  of  the  period.      In   other  words,   suppose  A(i)   "=  A^    >  0, 

i  =  0,    1,    ....    t,   and  A(i)   =  A     <   0,    i  =   t  4-  1,    . .  . ,    r.      Since 
r  ^ 

E     A(i)   «=  0,  we  must   have    (t  +   1)A     +   (r  -   t)A      =  0,   or  A-   =    (t  +  l)A,/(t  -   r) 
i-0  V  I  ^  i. 

aA   .      The   last   term  in   (10)    is   then  written 


-(1  -  p)A^ 
A 


t  k"(i)  r  k"(i) 

E  (1  -  p)    '^         +  a       E        (1  -  p)    '^ 

i"0 

^   ^  i=t+l 


(11) 


Suppose  t  +  1  =»  r  -  t.     Then  a  =  -1  and  the  term  in  brackets  in   (11)    is  zero  or 
negative  since  each  term  in  the  second  summation  is  not   less  than  any  term  in 
the  first    (k  (i)   is  nonincreasing  in  i   and  (1  -  p)    <  1)    and  there  are  equal 
numbers  of  terms  in  each  summation.      Letting  A    /A  =  D  we  have   9Q/3D  >^  0; 
holding  A  fixed,    the  more  skewed  the  construction  stream  is   toward   the  first 
half  of  the  period,    the  greater  is  Q.   In   addition,   increasing  A  while  holding 
A-    fixed   (and  holding  q  fixed   as  well)    decreases  Q.      Ift+l<r-t,    then 
a  >  -1,   and  since  the  suimnations  have  different  numbers  of  terms,  we  are 
xmable  to  say  anything  about   the  sign  of   (11)    and  hence  about   3Q/3D.      In 
this   case,   increasing  A      requires  decreasing  A„  by  an   amount  smaller  than  the 
increase  in  A. ,  but  the  decrease  in  A^  is   felt  over  a  larger  number  of  years 
than  is  the  increase  in  A    .      Wlien  a  =  -1,   A     falls  by   an  amount  equal     to 
the  increase  in  A     and  the  changes   are  spread  over  equal  numbers   of  years. 

So  far  we  have  seen   that  Q  depends   on  n,   p,   q,   and  D.     We  have  been 
able   to  evaluate  the  direction   of  influence  of  n,   p,   and  D  when  the  construction 
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flow  hatj  tY,3  e-nrr-  "i  at -i   ^^eatrrca.  What  remains  is  to  indicate  how  p  depends 
on  fxni.  parameters. 

Clearly,  firms  desirpd  to  switch  to  the  new  technology  because  of 
its  superiority;  p  is  an  inverse  measure  o^  the  inertia  restraining  firms 
from  adopting  the  innovation.   We  assicne  that  for  the  most  part,  firms  drew 
on  internal  funds  for  replacement  expenditures.   Since  steel  rails  were  more 
expensive  than  iron  rails^,  we   assume  that  the  greater  the  availability  of 
internal  funds  for  replacement  expenditures,  the  greater  the  probability  of 
the  switchover  from  iron  to  steel  for  a  fixed  cost  differential  between  steel 
and  iron  rails.   Similarly,  for  a  given  availability  of  internal  funds, 
the  higher  the  cost  differential  between  steel  and  iron  rails,  the  lower  the 
probability  of  switchover  from  iron  to  steel.   Since  mill  prices  for  rails 
will  not  vary  much  if  at  all  in  cross-section,  we  use  a  measure  of  differential 
shipping  distance  for  the  two  types  of  rails  as  a  measure  of  cost  difference. 
To  cover  the  possibil-lty  that  some  railroads  rorrowed  tc  finance  track  replace- 
ment, a  measure  of  borrowing  c^s  s  is  included  as  a  determinant  of  p.   Since 
the  difference  between  the  financing  cost  of  steel  and  iron  would  increase 
with  the  cost  of  borrowing,  we  assume  borrowing  cost  negatively  influences 
p.   Finally,  it  is  assumed  that  the  larger  the  railroad,  the  more  likely 
it  is  to  switch  to  eteel  from  iron.  It  is  likely  that  managers  were  better  in- 
formed in  larger  railroads  =ad  hence  more  willing  to  embrace  new  technology. 
In  addition,  if  ri inroads  ac  first  perceived  the  switch  from  iron  to  steel 
as  involving  risk,  the  likelihood   that  large  firms  are  less  risk  averse 
than  small  ones  alt^o  would  lead  to  a  positive  influence  of  size  on  p .  We 
measure  size  by  re/enue.   The  theoretical  model  thus  predicts  that^ 

Q  =  Q(n,  q,  D,  F,  C,  G,  R) ,  (12) 
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where  F  represents  the  availability  of  internal  funds,  C  measures  the  steel- 
iron  cost  differential,  G  is  borrowing  cost  and  R  is  revenue.   From  above, 
Q  decreases  or  is  constant  in  n.   We  have  shown  8Q/3q  $  0,  3Q/9D  >  0  if  A_^ 

is  properly  defined,  3Q/8F  >  0,  3Q/3C  <  0,  9Q/3G  <  0,  and  3Q/3R  >  0,  where 

the  latter  results  use  9Q/3p  >  0. 

II. 
The  actual  proportion  z  of  1880  track  that  had  been  switched  to 
steel  differs,  of  course,  from  the  expected  proportion.   Representing  the 
vector  of  explanatory  variables  in  (12)  by  x,  we  have 

z  "  Q(x)  +  u  ,  0.3) 


where  u  is  a  random  error  term,  the  difference  between  z  and  Q(x).   The  x 

X 

subscript  indicates  that  the  distribution  of  u  depends  on  x,  the  values  of 
the  explanatory  variables.   To  see  that  this  is  the  case,  consider  two 
railroads  with  C(i)  equal  to  zero  for  all  i  and  n  =  2.   The  random  variable  z 
can  assume  the  values  0,  h,      or  1  for  each  railroad.   Suppose  other  elements 
of  X  differ  between  the  railroads,  so  that  z  equals  0,  h,      or  1  with  pro- 
bability w  ,  y  ,  or  1  -  w^  ~  ^1 '  ^°^   railroad  one  and  equals  0,  %,   or  1  with 
probability  w  ,  y  ,  or  1  -  w„  -  y  for  railroad  two.   The  expected  values  of 
z  are  1  -  w  -  ^  y,   and  1  -  w„  -  %  y^   for  railroads  one  and  two  respectively. 
For  railroad  one,  the  error  term  u  assumes  the  value  -(1  -  w  ~  h  V-,)       with 
probability  w  ,  the  value  ^  ~    (1  -  w  -  h   y.)  =  -%  +  w  +  h  y..   with  probability 
y  and  the  value  1  -  (1  -  w  -  %  y  )  =  w  +  H  y^   with  probability  1  -  w  ~  Y-,  • 
For  railroad  two,  the  error  term  assumes  the  values  -(1  -  w  -  J5  y,) , 
-^  +  W2  +  ^  y,.  and  w  +  %  y  with  probabilities  w  ,  y  and  1  -  w  -  y 
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respectively.   This  example  shows  that  the  distribution  of  u  does  indeed 
depend  on  the  values  of  the  x  variables.   If  n  and  the  C(i)  not  been  the  same 
for  both  railroads,  the  set  of  possible  values  for  z  would  have  been  different 
for  each  railroad,  further  reason  for  the  dependence  of  the  distribution 
of  u  on  the  values  of  the  x  variables. 

The  next  step  is  to  choose  a  functional  form  to  approximate  Q(x) , 
which  is  highly  non-linear.  We  represent  Q(x)  by  the  logistic  function 
(1  +  e    )   .   Since  Q(x)  =(l+e)   +e,  where  e  is  the  approxima- 
tion  error  which  depends  on  x,  (13)  becomes 


(1  +  e"^''^  ^  +  s^,  (14) 


where  s  =  u  +  e  . 

XXX 

Since  we  do  not  observe  n,  the  length  of  life  of  the  rails,  we 
use   the  fact  that  n  =  n(B)  ,  where  B  is  a  measure  of  the  intensity  of  use  of 
the  track,  and  replace  n  by  B  in  the  vector  x.  "  B  negatively  influences  n, 
'-creases  in  B  tend  to  increase  Q. 

Transforming  (l4),  we  have 

.       ,^  .   a(x). 
/   \        1-z   (1  +  e^  ')s  ,,_. 

ga(x)  ^  ^ ^  (15) 

z  z 

Taking  logs   on  both  sides  of    (15)   and  expanding  the   log  of  the  LHS   around 
(1  -   z)/z  in  a  first-order  Taylor  series,   we  get   the  approximation 

1-z                                    (1  +  e"^^^)s 
log( )   =  a(x)   +  — -_— X 


s     -   e"^'^>(l  +  e'^<^>)-l 

(16) 
a(x)   +  V 


X 
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Slnce  the  distribution  of  the  error  term  v  depends  on  x  in  some 

complicated  way,  ordinary  least  squares  estimates  of  a  linear  or  log-linear 

a(x)  will  be  biased  and  inefficient.   However,  under  the  circumstances,  it 

appears  that  the  best  we  can  do  is  to  assume  that  the  error  term  in  (16)  is 

identically  distributed  across  the  sample  and  uncorrelated  with  x  and  compute 

ordinary  least  squares  estimates.   This  procedure  is,  of  course,  formally 

unjustifiable,  but  the  intractable  nature  of  the  estimation  problem  leaves  no 

alternative. 

Ill- 

Data  are   available   on  the  proportion  steel  of  track  mileage   ownel 
and   operated  by  the  railroads   in   1880.      Many  railroads,   however,   leased 
track   from  other   lines   for  their  own  use.      Our  measure   of  the   intensity   of 
use   of  the   owned  track,   which   is  discussed  below,   will  be  faulty  if   a  signi- 
ficant  fraction   of  the   operated  track  is   leased   from  other  railroads.      Thus, 
we  excluded  railroads   frran  the   sample  for  which   leased  track  in   1880   com- 
prised more   than  20%   of  total  mileage  operated.      The  great  majority  of   rail- 
roads  In  the  sample   leased  no   track  at   all,  while  the   average  proportion 
le-p  >(i   •'.mong  those  that  did   lease   track  was   about   10%  .  Other  criteria  fr  r 
Inclubion  in   the  sample  were   that   the   railroad  had   to  have   some   track  in 
operation   in   1867,    that  it  had  to  own  some  steel  track  in   1880,    so   that   the 
de^jend-    t  variable  is   defined,    and  that  it  had   to   lease  none   of   its    owned 
track   to  other  railroads   in   1880.      Forty-four   railroads  qualified  for  inclusl';n 
in  the  sample  under  these  criteria.      A  broad  range   of   firm  size  is   represe^*- 
in  the   sample. 

The  variables  used    in   the   regressions   are  discussed  next.     A  con- 
tfen?>orary  source    [1]    claims  that   the  best  measure   of   the  intensity  of  use 
of  railroad  track  is   the  number  of  trains   it  handles  per  period.      The  source 
states  that  the  product   of  train   tonnage   and   train  speed  is   a  good  proxy 
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for  the   track  wear  caused  by   a  train.      Since   freight    trains   are  heavier  and 
slower  than  passenger  trains,    the  source   claims   that   the   above  product   is 
roughly   constant   across   trains,   meaning  that   the  number  of  trains  handled 
per  period  is   an   appropriate  measure   of  intensity   of  use.      Consider  the 
ratio  of  train  miles   per  year  to  miles   of  track  in   the  system.      If  each 
train  passes   over  the   entire   track  system,   this   ratio  equals   the  number  of 
trains   per  year  passing   over  each  mile   of  the   system,    the   correct  intensity   of 
use  measure.      The   regressions   use  two  variants  of  a  measure  based  on   this 
ratio.      Bl  is   the   average   of   the  ratio  in   1874  and  1880:      (TM74/M74  +  TM80 Z'^'? '. ,  ,  _ , 
where  M  is  miles    of   track  in   the  system  and  TM  is    train  miles.      B2   is 
computed  using  MR,   miles   of  road,    or  main   track,   excluding  sidings    and 
auxiliary  track:      B2  =    (TM74/MR74  +  TM80/MR80) /2 .      Since  road  miles   are 
more  heavily  used  than   sidings   and  auxiliary   track,   it  was   felt  this  was   a 
better  measure   of  use   intensity.      The   averaging  is  meant  to  capture  variations 
In  use  Intensity  over  the  period.     T^ile   the   regression   results   using  Bl 
and  E2   are  very  similar,   B2   performs   slightly  better  than  Bl.      All  reported 
results   are   from  regressions   using  B2. 

The  variable  q   is   computed  as    (M80-BLT) /M80,   where  BLT  is   total 
coostruction  over  the  period.      Unfortunately,    construction  data  are  for 
miles   of  road  only,   excluding  sidings   and  auxiliary  track.      Thus   the   computed 
q  value   overstates   the   true  q  value  somewhat. 

Three  versions   of  the  A     variable  were   computed:      they  are  the 
amounts  by  which   the   average  number  of  miles   of   construction  per  year  in 
the  subperlods   1867-73,    1867-71,    and  1867-70  exceed  the   1867-1880  yearly 
average.      The  first   variant  corresponds    to  the  a  =  -1  case  above,  whereas 
the   latter  two   correspond  to  the   ambiguous   cases  where  a  >  -1.      It  should 
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be  noted  that  construction  expenditures  were  typically  uneven  instead  of  conr 

in  subperiods  as  the  model  requires.   Nevertheless,  we  assume  construction  expenu^ 

tures  were  spread  evenly  over  subperiods  of  interest  in  computing  the  A  measures 

of  the  skewness  of  the  construction  flow.   Below,  we  present  results  using 

D  (=  A  /M80)  values  based  on  the  first  and  third  versions  of  A   (the  results  with 

the  second  version  have  insignificant  estimates  of  the  D  coefficient) 

Our  measure  of  the  adequacy  of  internal  funds  for  replacement 
expenditures  is  based  on  net  revenue  per  mile  of  track  in  the  system.   A 
given  amount  of  net  revenue  is  less  adequate  for  replacement  expenditure  in 
a  large  than  in  a  small  system,  so  we  deflate  net  revenue  by  miles  of  track. 
The  variable  is  NRJI  =  (NR74/M74  +  WR80/M80)/2,  where  NR  is  net  revenue. 
The  revenue  measure  of  firm  size  is  R  =  (R74  +  R80) /2 ,  where  R74  and  R80 
are  gross  operating  revenues . 

The  measure  of  the  cost  differential  between  steel  and  iron  rails 
is  essentially  a  measure  of  differential  distance  between  the  iron  and  steel 
mills  and  the  railroad's  line.   The  variable  was  set  equal  to  unity  plus- 
the  smallest  number  of  states  separating  a  state  In  v;hlch  the  railroad 
operated  from  a  state  producing  steel  rails  minus  the  smallest  number  of 
states  separating  a  state  in  which  the  railroad  operated  from  a  state 
producing  iron  rails.   For  example,  if  steel  rails  were  produced  two  states 
distant  while  iron  rails  were  produced  in  a  state  adjacent  to  a  state  of 
operations,  the  variable  assumed  the  value  1+  (1^  0)  =  2.   The  variable 
was  computed  for  1870  and  1880  and  an  average  was  taken  and  denoted  LOG. 

As  a  measure  of  the  cost  of  borrowing,  we  used  the  highest  interest 
rate  on  those  railroad  bonds  outstanding  in  either  1867,  1874,  or  1880, 
denoted  BY.   The  rationale  was  that  raising  money  in  the  bond  market  for 
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replacement  purposes  would  have  required  high  yield  bonds  due  to  collateral 
problems.   This  is  admittedly  a  crude  measure  of  the  borrowing  cost  for 
replacement.' 

The  model  was  estimated  in  linear  and  log-linear  form.   The  log- 
linear  regressions  produced  higher  t-ratios  than  the  linear 
regressions ,  so  only  the  former  results  are  reported.   The  a  priori  signs 
for  the  coefficients  are  as  follows  (they  are  just  opposite  to  those  at  the 
end  of  Section  I,  given  the  logistic  formulation):   B2 ,  negative;  D  (first 
variant),  negative;  D  (third  variant),  uncertain;  q,  uncertain;  LOG,  positive; 
R,  negative;  NRM,  negative;  BY,  positive. 

The  estimates  from  fitting  equations  using  D  are  reported  in  the 
first  three  rows  of  Table  1.   In  the  equation  with  all  the  variables  included, 
the  coefficients  of  B2  and  R  are  significantly  negative,  as  expected,  at  the 
one-tailed  5%  level,  while  the  coefficient  of  LOG  is  significantly  positive, 
as  expected,  at  the  one-tailed  10%  level.   The  coefficient  of  q  is  not 
significantly  different  from  zero,  which  is  ccnsistent  with  its  uncertain 
a  priori  sign.   However,  the  coefficients  of  D  ,  NRM,  and  BY  are  also  not 

significantly  different  from  zero,  contradicting  a  priori  expectations. 

2 
The  equation's  R  seems  high  given  the  nature  of  the  dependent  variable. 

Kennedy  and  Bancroft  [3]  have  developed  a  sequential  variable 

exclusion  technique  for  model  selection.   Optional  variables,  those  which 

may  or  may  not  belong  in  a  model,  are  sequentially  excluded  in  reverse  order 

of  a  priori  iiiroortance  when  the  null  hypothesis  that  their  coefficients 

are  zero  cannot  be  rejected  at  a  given  confidence  level,  optimally  chosen  to  be 

.25  by  numerical  calculations  in  Kennedy  and  Bancroft.  After  each  exclusion, 

the  model  is  reestimated.   The  sequential  t-test  uses  the  variance  estimate  from 

fitting  the  full  model.   The  technique  is  based  on  the  assumption  of  normal 
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error  terms,    so   its  application  in  our  model   is  not  strictly   correct.      Thus, 
the  use  of    the   technique  below  must  be  viewed  with  some  reservation. 

The  variables   that   definitely  belong  in   our  model  are  B2,  q, 
and  D.    The   optional  variables   are  LOC,   NRM,   R,   BY,  x<?ith  BY  being  the   least 
important.      BY  may  be   deleted  according  to   the  Kennedy-Bancroft   criterion 
(the  critical  points   are   roughly  +  1.15),    and  reeatimatlon   results   in   the 
estimates   on   the  second     row   of  Table   1.      The   only  major   change   is   that 
the   coefficient   of  D     is  now  significantly  negative,   as   it  should  be,   at   the 
10%   one-tailed   level.      It   should  be  noted  that  BY  may   also  be   deleted  according 
to  Sawa  and  Hiromatsu's  minimax  regret   criterion    [6],  which  sets   the  critical 
values   at  +  1,371. 

If  we  believe   that  NRM  is    less   Important  than  R,    then   the   exclu- 
sion  criterion   calls   for  its  deletion   as  well.      Reestimation  results   In  row 
3  of  Table   1;    the   coefficient  estimates   for  the  remaining  variables   are 
esentially  unchanged  from     rov7   2.      Since   the  argument   for  including  NRM 
among  the   determinants   of  the  probability  p  is  more  persuasive   than  that   for 
inclu— ag   R,    dtL^ting  NRM  is   probably  undesirable.      Since  we   are   able   to 
reject   the  null  hypothesis  that  R's   coefficient  is   zero   (using  the  variance 
estimate  from  the   full  model  to  make   the  test)  ,   the  exclusion  procedure  should 
stop   at     row   2. 

The   last   three     rows    of  Table   1  give  estimates   of  the  model  para- 

2 
meters  using  D   .      Tue  R     is   somewhat  higher  in   line   4  than  in   line   1,   LOC' 3 

significance   level  improves   to  5%,   and  D  becomes   significantly  negative  at 

the  5%   level.      He  may  not  exclude  EY  by  the  Kennedy-Bancroft   criterion, 

although  Sawa's   criterion   allows   its  deletion.      From  line  5,   reestimation 
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without  BY  lowers  LOC's   and  R's  coefficients   significance   level  to  10% 

2 
and  reduces   the  R     somewhat.      As  before,    deleting  NRM  leaves   the  estimates 

of   the   remaining  coefficients  essentially  unchanged. 

IV. 

The  empirical  results  in  Table  I  show  that  the  model  developed 
in  Section  I  is  not  inconsistent  with  the  data.   The  coefficient  of  the 
important  intensity  of  use  variable  always  has  the  correct  sign  and  is  strongly 
significant  in  each  equation.   The  q  variable  belongs  in  the  estimating 
equation"  according  to  model,  but  its  coefficient's  lack  of  significance 
confonns  nicely  to  the  model's  ambiguity  about  its  sign.   The  D^  variable 
performs  reasonably  well  in  the  first  three  eqxiations ,  especially  after  the 
deletion  of  BY.   Among  the  optional  variables,  LOG  and  R  are  always  signifi- 
cant with  the  correct  signs.   It  is  heartening  that  the  only  variables 
which  do  not  perform  well,  BY  and  NRM,  are  optional  variables  whose  inclusion 
in  the  model  was  discretionary. 

The  fact  that  the  coefficient  of  the  D  variable  is  significantly 
negative  was  not  predicted  by  the  model.   However  since  some  measure  of  the 
skewness  of  the  construction  flow  belongs  in  the  estimating  equation  according 
to  themodel,  the  good  performance  of  D-  is  not  surprising. 

The  empirical  results  in  this  paper  are  quite  encouraging, 
suggesting  that  a  complex  microeconomic  model  of  the  track  replacement  process 
generates  predictions  which  conform  well  to  the  historical  facts.   The 
empirical  success  of  realistic,  detailed  economic  models  is  always  a  happy 
occurence,  increasing  our  confidence  in  economic  theory.   Of  course,  one 
could  argue  that  a  completely  ad  hoc  model  of  the  diffusion  of  steel  rails 
could  lead  to  estimating  equations  similar  to  those  in  this  paper.   For  instance. 
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casual  theoretical  argiments  could  be  advanced  supporting  the  inclusion 
of  an  intensity  of  use  measure  such  as  B2  in  any  estimating  equation.   How- 
ever, it  is  unlikely  that  an  ad  hoc  model  would  lead  to  a  specification  which 
includes  the  variables  q  and  D,  which  measure  the  fraction  of  1880  track 
existing  in  1867  and  the  skevmess  of  the  construction  flow,  respectively. 
Thus,  the  empirical  model  specification  in  this  paper  is  not  likely  to  be 
deducible  from  an  ad  hoc  theoretical  model,  and  we  can  be  confident  that  the 
success  of  the  empirical  tests  vindicates  our  theoretical  model  while  lending 
no  support  to  a  competing  naive  hypothesis.   In  conclusion,  we  note  that  the 
results  in  this  paper  argue  for  a  more  rigorous  approach  to  modeling  diffusion 
processes  than  has  characterized  past  work.  Rigorous  theoretical  models  may 
well  be  successful  in  explaining  the  spread  of  new  technology  in  a  variety  of 
industries  and  time  periods. 
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Footnotes 


See  Fishlow. 

^It  is  possible  to  formulate  the  model  with  a  continuous  instead  of  integer- 
valued  n.   Then  the  equivalent  of  (5)  is 

_  j   P(i,  n)di. 
n  J  0 
This  model  is  somewhat  less  transparent  than  the  discrete  one,  however. 

^Suppose  the  switchover  probability  grew  at  a  constant  annual  percentage 
rate  a:   the  probability  that  iron  \<ias   used  in  period  x  is  p(l  +  a)-  ■ 
The  probability  P (i,  n)  that  track  initially  failing  in  period  i  becomes 
steel  by  period  r  is 

k"(i)  +  l 

1-  %    [1  -  p(l  +  a)J"^'^). 
j=0 

The  expression  equivalent  to  (7),  with  C (i)  =  0  for  simplicity,  is 

n-  i   r 


Q  =  1  -  ^  'z     n   (l  -  p(l  +  ay^""^^]. 

"  i=0    j=0 

It  is  not  possible  to  show  an  unambiguous  effect  for  a  change  in  n,  as 
in  (8),  under  this  model. 

4 
See  Temin. 

Tit  could  be  argued  that  the  probability  of  switchover  should  also  depend  on 
the  degree  of  superiority  of  steel  over  iron  rails,  which  would  be  positively 
related  to  the  intensity  of  use  of  the  track.   Below,  n  in  (13)  will  be 
replaced  by  B,  a  measure  of  intensity  of  track  use.   Arguing  that  B  also 
positively  influences  p  does  nothing  to  change  the  specification  of  the 
model  or  the  a  priori  signs  of  estimated  coefficients. 


^ 


he  discrete  character  of  n  in  the  model  and  the  continuous  nature  of  B 
mean  that  n(B)  should  be  a  step  function.   However,  our  formulation  involves 
a  continuous  relationship  bet^v'een  S  and  the  value  of  the  logistic  function. 
The  error  term  e  is  presumed  to  smotth  over  this  discrepancy. 

'An  observation  on  3Y  was  not  available  for  one  small  railroad,  so  the  BY 
value  for  this  railroad  was  set  equal  to  the  average  of  the  BY  values  for 
two  other  small  firms. 
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